首页> 外文OA文献 >BioCaster: detecting public health rumors with a Web-based text mining system
【2h】

BioCaster: detecting public health rumors with a Web-based text mining system

机译:BioCaster:使用基于Web的文本挖掘系统检测公共卫生传闻

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Summary: BioCaster is an ontology-based text mining system for detecting and tracking the distribution of infectious disease outbreaks from linguistic signals on the Web. The system continuously analyzes documents reported from over 1700 RSS feeds, classifies them for topical relevance and plots them onto a Google map using geocoded information. The background knowledge for bridging the gap between Layman's terms and formal-coding systems is contained in the freely available BioCaster ontology which includes information in eight languages focused on the epidemiological role of pathogens as well as geographical locations with their latitudes/longitudes. The system consists of four main stages: topic classification, named entity recognition (NER), disease/location detection and event recognition. Higher order event analysis is used to detect more precisely specified warning signals that can then be notified to registered users via email alerts. Evaluation of the system for topic recognition and entity identification is conducted on a gold standard corpus of annotated news articles.
机译:简介:BioCaster是一个基于本体的文本挖掘系统,用于从Web上的语言信号中检测和跟踪传染病爆发的分布。该系统连续分析从1700多个RSS提要中报告的文档,对它们进行主题相关性分类,并使用地理编码信息将其绘制到Google地图上。可免费获得的BioCaster本体中包含弥合Layman术语与形式编码系统之间差距的背景知识,其中包括八种语言的信息,重点关注病原体的流行病学作用以及地理位置(经度/纬度)。该系统包括四个主要阶段:主题分类,命名实体识别(NER),疾病/位置检测和事件识别。高阶事件分析用于检测更精确指定的警告信号,然后可以通过电子邮件警报将其通知给注册用户。用于主题识别和实体识别的系统的评估在带有注释的新闻文章的黄金标准语料库上进行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号